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ABSTRACT 

Many of the important conclusions about Gamma-Ray Bursts follow from 
the distributions of various quantities such as peak flux or duration. We show 
that for astrophysical transients such as bursts, multiple selection thresholds 
can lead to various forms of data truncation, which can strongly affect the 
distributions obtained from the data if not accounted for properly. Thus the 
data should be considered to form a multivariate distribution. We also caution 
that if the variables forming the multivariate distribution are not statistically 
independent of each other, further biases can result. A general method is 
described to properly account for these effects, and as a specific example we 
extract the distributions of flux and duration from the BATSE 3B Gamma- Ray 
Burst data. It is shown that properly accounting for the aforementioned biases 
tends to increase the slope of the log A-log S relation at low values of S, and 
dramatically increases the number of short duration bursts. 

Subject headings: gamma rays: bursts 

1. Introduction 

The unusual nature of Gamma-Ray Bursts (GRBs) has provided many challenges 
for astronomers in the design of instruments and observations for their detection, in 
analysis of the data, and in theoretical modeling. The primary sources of these difficulties 
are their extreme variability, short durations (< I0 3 s), varied "light curves," and the 
apparent absence of any corresponding lower energy photons. Consequently there exist no 
simultaneous observations at other wavelengths, especially those accessible to ground based 
observatories which could allow identification of counterparts and provide knowledge of their 
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distances, luminosities, and spatial distributions. As is the case in all other astronomical 
situations, the absence of direct knowledge of distance means that we must rely on the 
so-called log iV-log S distribution to infer the combined spatial and luminosity distribution 



For steady sources, with S standing for the flux / = L/Airr 2 , the number of sources at 
flux / is related to the luminosity function $ as 



where V(r) is the volume (co-moving for cosmological sources) of space occupied up to 
distance r (the luminosity distance for cosmological sources). From comparisons of n(f) 
with observations one hopes to determine the luminosity function and the geometry of the 
space occupied by the sources. This of course cannot be done uniquely from the log iV-log S 
analysis alone, and we must rely on other information for a complete picture. Nevertheless 
it is clear that the log N- log S distribution is an important tool for the study of GRBs, and 
it is imperative that it be determined accurately from observations. 

For GRBs, and variable sources in general, care is required for an accurate and 
bias-free determination of their distributions. When a source can vary over a time scale 
shorter than the length of accumulation of data, there is the uncertainty whether to use 
peak flux or total (time integrated) flux (i.e. fluence) for the parameter S. GRBs, with 
durations ranging from 10~ 2 to f 3 s, clearly fall into this category and the earlier analyses 
of their log iV-log S relation suffered from this uncertainty. This difficulty was somewhat 
alleviated by analyses of the data in terms of c = Cp/Cu m (or equivalently in terms of the 
so-called V/V max = c~ 3 / 2 ), where Cp is the peak count rate (averaged over a predetermined 
trigger interval At) and Cu m is the variable threshold rate (which also depends on At). 
Unfortunately, as pointed out by Petrosian (1993; hereafter PI) the distribution of the 
ratio c, aside from being capable of testing the simplest model (a homogeneous, isotropic 
distribution of sources in a static, Euclidean geometry, HISE for short), is of little use in 
the determination of $ or V(r) in equation (1). As shown in PI, the distribution of c is 
related in a more complicated way to $ and V, involving a convolution of equation (1) with 
the distribution of Cu m . It was also shown in PI that such a complication is unnecessary; 
the distribution of Cp (or the peak flux fp) can be obtained directly from the observed 
bivariate distributions of Cp and C; 8m , with correct accounting of the bias introduced due 
to variable Cu m . 

The variability in the threshold rate and the finite triggering interval At introduce 
additional complications and selection effects which need to be considered. Many 
instruments, including BATSE on the Compton Gamma-Ray Observatory, define a burst to 
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be occurring when the average photon count rate 

rt+At 

C(t) = J C{t)dt/At (2) 

exceeds the threshold rate Cu m . Bursts are characterized by Cu m and the maximum 
of this rate Cp = max[C(t)]. BATSE uses three trigger intervals At = 64, 256 and 
1024 ms, with corresponding values of Cu m determined for each value of At. The bias 
introduced by this process, even if Cu m were constant, occurs because bursts could have 
true durations T larger or shorter than At. As pointed out by Petrosian, Lee & Azzam 
(1994; hereafter PII), for T 3> At the average Cp is a good representation of the true peak 
rate Cp = max[C(t)], but for bursts with T <C At the average rate Cp is proportional to the 
total count fluence Tc = J T Cdt as Cp = Tel At and clearly is an underestimation of the 
true Cp, with the degree of underestimation increasing with decreasing T/ At. Therefore 
the observed distribution must be characterized by at least three variables: Cp, C; 8m , and 
T. Furthermore, it is also clear that the above bias depends on the shape of the light curve. 
For example, bursts with T 3> At which have a spiky peak with a time scale shorter than 
At also suffer from the above bias. 

Thus in reality the task at hand is more complicated. We need to extract the 
univariate distributions n(Cp), g(T), etc. from a multivariate observed distribution 
ip(Cp, Cu m} T, Q!j-, . . .), where a 8 - are parameters characterizing the shape of C(t). Care 
is necessary in such an extraction because the data is truncated, sometimes simply but 
sometimes in a complicated way, by the triggering and other observational selection effects, 
and because the variables of these multivariate distributions may not be independent of 
each other (they might be correlated). 

In §2 we describe the difficulties that arise because of these effects and describe 
methods that we have developed to overcome them and obtain accurate distributions. 
Applications of these methods to the data from the BATSE 3B catalog are described in 
§3, where we derive the distributions of peak flux and duration. As already was evident 
from our preliminary application of these methods to the IB catalog in PII, we find that 
ignoring these complications gives misleading results. Finally in §4 we present a summary 
and discussion of our results. 

Note that usually this analysis is done in terms of the cumulative distribution 

/•OO /"OO //T/ /"CO 

N(f)= n(f')df'= —dr $(L } r)dL. (3) 

J / JO dr J4irr 2 j 

The differential distribution with its independent points is preferred. We treat the problem 
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in terms of the differential distribution or the logarithmic slope 

s(f) __dJ^N_Mfl (4) 

Uj " dlog/-JV(/)' W 

which in spite of its appearance represents a set of independent slopes. Further details on 
this aspect of the problem have been presented by Efron & Petrosian (1992) and Azzam & 
Petrosian (1996). 



2. Obtaining Univariate Distributions From Multivariate Data 

The problems mentioned above associated with analysis of GRB data, and in many 
other astronomical data analyses, can be reduced to the generic case of the determination of 
a bias-free univariate distribution from a multivariate distribution function. To simplify the 
discussion in what follows in this section we will consider the case where the multivariate 
distribution has only two variables and can be written as ip(x,y). The generalization to the 
multivariate situation only adds computational complexity with little conceptual difficulty. 
Furthermore, let us assume that ip is separable, such that 

^{x,y) = f{x)g{y). (5) 

In order to obtain univariate distributions which are unbiased, two major effects must be 
taken into account. The first is the effect of data truncation due to observational selection 
processes. The second is the correlation that could be present between the variables (x and 
y in this presentation). 



2.1. Effects of Truncation 

Because of the limited sensitivity of experimental or observational procedures, all data 
are essentially truncated by the sensitivity thresholds. Namely, the available data is limited 
to say x > xu m and y > yu m (the situation with upper limits can be treated similarly 
and will not be considered here). The simplest truncations, in which xu m and yu m are 
constants, pose no additional problem. Data with only these kinds of truncations parallel 
to the axes are sometimes referred to as untruncated. Difficulty arises when one (or both) 
of these thresholds is a function of the other variable. For example, if threshold xu m is 
constant but the detection threshold of y is a function of x, say yu m = h(x), then the data 
are called truncated and the distributions of observed X{ and (where i = 1, 2, ... n, and n 
is the total number of data points) are biased and do not represent the true distributions 
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f(x) = Jq 00 ipdy or g(y) = ipdx. Note that this general truncation can be reduced to the 
generic case by transformation of the variables x' — > h(x),y' — > y resulting in the generic 
truncation y' > x' . 

The GRB data suffers from at least two truncations. The well known one is the 
requirement that Cp > Cu m , arising from the variable threshold Cu m . The usual practice 
to account for this truncation has been to treat the data in terms of c = Cp/Cu m or 
V/V max = c~ 3 / 2 . This is equivalent to the transformations x" — > x and y" — s- y/h(x) in 
the above example, which reduces to the case of simple truncation parallel to the axes. As 
mentioned in §1, this treatment has a limited utility and is shown in PI to be unnecessary 
because one can account for this bias and obtain the distributions of Cp and Cu m directly. 
Another bias, the subject of PII, is due to the finite duration At of the burst trigger process. 
This gives rise to a peak photon count rate threshold which is a function of duration, i.e. to 
a truncation in the Cp-T plane. As also shown in PII, the effect of this truncation can also 
be accounted for by using the same methods used for the truncation due to the variation of 
Cum- We will return to these biases in the next section. 



2.2. Effects of Correlations 

The second major effect, which has not been recognized or acknowledged widely, is 
that even when the truncation is simple, i.e. parallel to the axes (xn m and yu m independent 
of y and x, respectively), the observed distributions of X{ and will be biased if x and y are 
correlated. For example, suppose that the average value of y varies with x as (y) = k(x), so 
that we can write 

^(x,y) = f(x)g(y/k(xj)/k(x). (6) 
It is then easy to show that for y > yu m the observed distribution of x is 

/•oo 

fobs{x)= tp(x,y)dy = f(x)G(yi im /k(x)), (7) 
Jyum 

where 

[•CO 

G{u) = / g(u')du', G(0) = 1, (8) 

J u 

which is in general different from the true distribution f(x). The difference between f(x) 
and fobs(x) decreases with decreasing dispersion in the values of x and y. Only in the 
case of perfect correlation with no dispersion (g(u) = S(u)) will f bs(x) be proportional to 
f(x). It is clear that a similar discrepancy will be present between the true and observed 
distributions of y. 
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In the case of GRBs, if Cp and Cu m (or Cp and T) are correlated we cannot ignore 
the distributions of Cu m or T in the determination of ra(Cp), which has been a ubiquitous 
practice in previous studies. Note that such correlations cannot be ignored even when 
dealing with the distribution of the ratio c = Cp/Cu m . Hence, the well-known deviation 
of the logarithmic slope of the cumulative distribution of c from -3/2 or the deviation 
of (V/V max ) from 0.5 does not necessarily indicate a deviation from HISE unless it is 
known that Cp and Cu m are uncorrelated with each other or T . All of the far-reaching 
consequences derived from the analysis of the distribution of c or V/V max are based on this 
seemingly untested assumption. 

Clearly both the effects of truncation and correlation must be taken into account when 
dealing with multivariate data that is truncated and its variables are correlated. When 
truncation is included in the above analysis, equation (7) becomes, upon replacement of 
yu m with h(x), 

Us(x) = f(x)G(h(x)/k(x)). (9) 

This replacement further differentiates between the observed and true distributions, unless 
the correlation form k(x) is known a priori and the truncation is chosen parallel to it, 
namely h(x) oc k(x). It should also be noted that if the degree of correlation between the 
variables and the distribution of one of the variables is known the true distribution of the 
other variable can be determined from the observations and equation (9). Unfortunately 
such a priori knowledge often does not exist, and generally this information must be 
deduced from the observed data itself. 



2.3. The Basic Non-Parametric Method 

There are many ways of obtaining univariate distributions from bivariate or multivariate 
data. Most rely on parametric fits to binned data. There are two problems with these 
methods, beyond those mentioned above in regards to truncation and correlation. First, 
a priori parameterization is arbitrary and can give misleading results. Second, binning 
amounts to smoothing the data, which often means that all the data cannot be used and 
results in loss of information on scales smaller than those used for the binning. Of course, in 
the presentation of results or when comparing with theoretical models it may be necessary 
and convenient to bin the results. However, this should be done only after all the data is 
utilized to its maximum extent. Therefore, a non-parametric analysis of the data and the 
avoidance of binning (or the choice of bin sizes which are smaller than the observational 
errors) are preferred. The method that we have developed and applied to GRB data in the 
past and for this paper is non-parametric and does not use any binning. 
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As shown in the previous two subsections, a correct procedure for the determination 
of the distributions requires first and foremost an accurate measure of the degree of 
correlation between the variables in a truncated data set. Standard correlation tests fail 
in the presence of truncations, which may introduce artificial correlations. In our method, 
both the determination of correlations and the extraction of the univariate distributions 
once the correlation is known rely on the concept of the associated set. 

Consider a set of data points i = 1, 2 ... n, ordered in y such that yi > 

which is truncated according to the relation yi > h(xi). For each data point, say there 
exists an associated set of points Aii which is defined to consist of all data points within 
the largest untruncated (i.e. simply truncated) region. These are points (xj,yj) with 
yj > yi and Xj such that h(xj) < yi } and lie in a box with sides parallel to the x and y 
axis (sometimes these sets are defined with y 3 > yi so that they include the data point yi 
in question). Clearly, we can define similar sets, say Mi associated with points Xi, where 
Mi = {(xj,yj) '■ Xj < Xi and y 3 > h(xi)}. Boxes containing sample associated sets Aii and 
Mi are shown in Figure 1 for the bivariate distribution of peak and threshold fluxes fp and 
flim as defined below for the BATSE 3B 1024 ms catalog. 

If the variables x and y are uncorrelated or are stochastically independent, then the 
various characteristics of the distributions can be expressed as functions of the number of 
points in the associated sets. In fact, for any given point the reciprocal of the number 
of points Mi in the associated set Aii is a direct measure of the logarithmic slope of the 
cumulative distribution G(y) at yi. It can be shown (for details see Efron & Petrosian 1992 
and Azzam & Petrosian 1995) that the logarithmic slope at yi is given by 

_ y_dG_ _ 8(y - y t ) 
SG ' t ~ Gdy ~ Mi + Q(y- yi y 

where 8 is the Dirac delta function, and is the Heaviside step function. It follows that 
the cumulative distribution G is obtained from the data as 

G(y l ) = G(y 1 )f[(l + M- 1 ), i > 1, (11) 

which is the relation obtained from the method first proposed by Lynden-Bell (1971) and 
later elaborated on by Woodroofe (1985) and Petrosian (1986). G(y\) is the value of G 
for the first point, which is unknown since by definition there are no data points with 
y > y\. It turns out that all non-parametric methods in the limit of one object per bin 
reduce to this simple method (for details see the review by Petrosian 1992). From equations 
(4), (10), and (11) we can obtain the differential distribution g(y) = G(yi)sa,i/yi- Note 
that independently of these distributions we can obtain the slope sp,i, the cumulative 
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distribution F(xi) = Jq' f(x)dx } and f(x{) at all values of X{ in terms of the unknown value 
of F(x\) at the lowest observed point. Whether the points are arranged in increasing or 
decreasing order is a matter of convenience or preference. 

The concept of the associated set is also used in the test for correlation. Since the 
truncation of the data can produce an artificial correlation between the observed values of X{ 
and a simple correlation test will lead to misleading results. Only correlations involving 
untruncated sets of points, i.e. associated sets, are real. Efron & Petrosian (1992) describe 
a test of independence based on rank ordering in the associated sets. One outcome of such 
a test would be a parametric determination of correlation, e.g. the function k(x) introduced 
above (see Lee, Petrosian, & McTiernan 1993 & 1995 for examples of this). Once k(x) is 
known, the transformation y' — > y/k(x) would lead to a new set of uncorrelated variables 
(x, y'), the distributions of which can be obtained from equations (4), (10), and (11). 

3. BATSE Flux-Duration Distributions 

The importance of the distribution of peak fluxes (photon count rates) has already 
been stressed in the introduction. The distribution of burst durations also gives important 
information about the nature of the sources. For example, analysis by the BATSE team 
(Kouvelieutou et al. 1993) of the raw distribution of durations for bursts in the IB catalog 
(without regard for the selection effects and potential biases due to correlations) indicates 
the possible existence of two populations of bursts, with the separation between these 
occurring at a burst duration T ~ 2 s. This separation is less clearly evident in the 3B 
catalog raw data. It is therefore important to determine how the effects we are considering 
here change this picture. Furthermore, the determination of the degree of correlation 
between the peak fluxes and durations of the GRBs is important not only for finding the 
individual distributions of these two characteristics but also for the so-called cosmological 
time dilation test. If the absolute peak luminosity of GRBs is constant (a very unlikely 
hypothesis considering the large dispersion in the pulse shapes and durations of GRBs), 
then a cosmological distribution of sources should show an anticorrelation between duration 
and peak flux. Weaker sources, being at higher redshifts, would be time dilated by larger 
amounts. Norris et al. (1994 & 1995) show that there exists such a relationship for a subset 
of GRBs observed by BATSE. These studies examined only a small fraction of the total 
number of bursts, for which the truncation effects were unimportant. The reality of this 
result and its interpretation as simple cosmological time dilation has been questioned (Band 
1994; Mitrofanov 1994; Fenimore et al. 1995). Using our methods, we may examine this 
relation for all GRBs for which fluxes, flux limits, and durations are known. This data set 
comprises ~ 50% of all bursts, as compared to the ~ 20% used in the Norris et al. studies. 
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As shown below and described in greater detail in another publication (Lee & Petrosian 
1996a), we cannot confirm the Norris et al. results with the publicly available BATSE 
catalog data. 

Before we proceed further we must define what we mean by the duration of a burst. It 
is well recognized that GRBs do not have well-defined pulse profiles. Light curves are as 
varied as the number of the bursts. In the distribution ip(Cp, Cu m , T, a 8 ) we would require 
many parameters a 8 - to describe all of the observed pulse shapes. Therefore, it is futile to 
attempt to solve this problem exactly. We are forced to average over all possible light curves 
and reduce the distribution to a trivariate one, ip(Cp,Cu m ,T). There are various ways one 
can define a pulse duration. The BATSE catalogs provide two duration estimates, T 90 and 
T 50 , which are the intervals of time containing 90% and 50% of the burst fluence. 

For simple pulses these and other definitions of duration are related to and are often 
proportional to each other and to the total duration or characteristic time scale of the pulse. 
For example, for a square pulse T x = T(x/100) and for a triangular (one or two sided) 
pulse T x = T(l — ^1 — x/100), where T is the total duration. Similarly, for an exponential 
pulse of time constant r we have T x = r In (100/(100 — x)). For more complicated pulses 
consisting of several overlapping or well separated spikes, T 90 could be a measure of the 
separation between spikes, while T 50 could be a measure of the duration of the main pulse 
of the burst. 



3.1. Data Truncation Effects 

The observed distributions of Cp, C/; m , and T are subject to two truncations. The first 
arises because Cp > C/; m , where Cu m is variable. The second, the existence of which and 
its effect were pointed out in PII, arises because of the finite duration of the trigger interval 
At. BATSE uses three values, 64, 256 and 1024 ms, for At. A burst occurs if the total 
instrument counts accumulated during one of these intervals exceeds the predetermined 
value of the threshold counts C/; mj At- The peak counts listed in the catalog are the largest 
values of 

rt+At 

F(t,At) = J C{t')dt' (12) 

for the triggered bursts. If we define an average count rate C/± t {t) = F(t, At)/ At (as in Eq. 
[2]), then the peak photon count rate Cp t At is the maximum of Ca*(^) (from now on Cp 
should be understood to implicitly mean Cp^t)- This however is not necessarily equal to 
the true peak rate Cp, which is the maximum of the light curve C(t). 

There are two effects which make Cp ^ Cp. The first and the more important 



- 10 - 



bias causes Cp < Cp for any burst which varies significantly over time scales less than 
At (namely short duration bursts or bursts with a dominant short duration spike). In 
particular, for bursts with total duration (T < At),CpAt is equal to the count fluence 
Tc = Jp C(t)dt of the burst. On the other hand, for bursts with durations (or time scales of 
the strongest spike) T 3> At, the average Cp is a good measure of the true peak rate Cp. 
For a simple pulse shape it is easy to see that Cp/Cp < 1, with the ratio decreasing rapidly 
for pulses with durations less than At. For example, for square, triangular and exponential 
pulses with duration or time constant T we have, respectively, 

C = \ Cp forT> At 

P \ CpT/At = Tel At for T < At K ' 

(Cp(l-At/2T) forT>At 

\ C P T/2At = Tel At for T < At y ' 

C P = Cp(1 - e- A ^)/(At/r) 

f C P (1 - At/2r) for r > At 
\ Cpr/At = Tel At forr<At 

The above expressions could be off slightly due to bin edge effects, with the maximum 
error a factor of two in the unfortunate case where exactly half of the fluence of a short 
(T < At) burst happens to be located exactly on the edge between two integration periods 
of time At. Also note that the second expression in each case shows that CpAt = Tc } the 
count fluence, and is not a measure of peak photon count rate. Thus, Cp as determined by 
BATSE is a measure of peak rate only for long duration and gradual events (those with 
time scales T > At), while it is a measure of the fluence for short duration bursts. For 
bursts with long durations but containing a dominant spike with duration T sp ik e < At, Cp 
is an underestimation of Cp. Such bursts could greatly complicate the issue. However, 
as we shall see below, the majority of bursts do not have this troublesome characteristic. 
Therefore, in any analysis of this data we should treat the short and long duration (relative 
to trigger time At) bursts separately or find a method to correct for this bias. From 
examination of equations (13), (14), and (15) we can see that a rough approximation to 
correcting for the bias would be to make the transformation Cp — > Cp } with 

C P fT + AV 



(15) 



Cp v r y ' < 16 > 

which to within a factor of less than 2 agrees with all three pulse shapes. Cp would then be 
our best estimate of the true peak count rate. 

We empirically test the above relation with the BATSE data. Since two of the BATSE 
trigger times At (64 ms and 1024 ms) are widely separated, for bursts with T or r > 64 ms, 
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Cp,64 will be a good measure of the true Cp so that the ratio Cp^/Cp.itm would provide 
a good estimate for the right hand side of equation (16) for At = 1024 ms. For very short 
bursts (T 90 < 64 ms), both the 64 ms and 1024 ms triggers give us a measure of the fluence 
Tc so that the ratio Cp^/Cp.itm - ► 16 and is no longer a measure of the right hand side 
of the above equation. The top panel of Figure 2 shows this ratio (which was called the 
variability by Lamb, Graziani, & Smith 1993) for the BATSE 3B data as a function of T 90 . 

As expected for T 90 > 1024 ms and T 90 < 64 ms this ratio tends towards the asymptotic 
values of 1 and 16 (though because of the very same bias there are very few bursts with 
T < 64 ms). For 64 < T 90 < 1024 ms the general trend of the data is very similar to the 
expression on the right hand side of equation (16), which is depicted by the solid line. The 
exact expression for the three simple pulse shapes (using the correct relation between T 90 
and T) are shown by the three dashed curves in Figure 2 (bottom panel). 

Aside from the relatively good agreement it should be noted that there are very few 
bursts in the upper right hand corner of this figure. Bursts which have a long overall 
duration but contain a sharp spike (with time scale < 1024 ms) would occupy this part of 
the diagram. As mentioned above such bursts would greatly complicate the bias correction. 
The absence of such bursts indicates that the dominant spike in a long duration burst 
tends to be broader than 1024 ms (C (t) / dC (t) / dt > 1024 ms near the peaks). However, 
the deviation from unity of the ratio at the long duration end indicates that this is not 
always true. Undoubtedly some of the vertical dispersion is due to random fluctuations and 
observational errors. In addition, some of the deviation from unity could be due to a second 
bias in the data, which in contrast to the above is important for slowly varying (not spiky) 
bursts. This bias is discussed by Lamb et al. (1993), who attribute it to C. Meegan. This 
so-called peak flux bias is a minor effect and arises from statistical fluctuations which could 
potentially give a peak count rate Cp > Cp for bursts with small Cp and long durations 
(T ^> At). In appendix A we discuss the peak flux bias in more detail and carry out some 
simulations to determine its effect. The bottom panel of Figure 2 shows the same data as in 
the top panel (dotted histogram), except that it has been median filtered in T 90 in groups 
of 3 bursts for the purpose of clarity. The solid histogram shows the Cp^/Cp.itm ratio 
"corrected" for the peak flux bias under the assumption of square pulses, which results in 
the maximum difference between the observed and true ratios. The theoretical values of 
the ratio for the pulse shapes given by equations ( 13)— ( 15) are also shown on this diagram. 
Given the uncertainties inherent in the pulse shapes of the bursts and the correction for the 
peak flux bias, we take equation (16) to be a good fit to the data and in what follows we 
use this to correct for the main bias discussed above. 
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3.2. Testing for Correlations 



Having determined the various truncations of the trivariate data, we can now proceed 
with the determination of the distributions of individual variables. As stressed in §2, the 
first step in the analysis of multivariate data is testing the data for any correlation between 
the variables Cp, Cu m and T. The BATSE catalog, in addition to providing Cp and C; 8m , 
also gives the photon flux fp 



for the energy range 50-300 keV, where A e ff(6, cf>) is the effective area of the detector for 
the direction 6, cf> of the bursts. Clearly the flux (versus count rate) threshold for a burst 
in a given direction is also given by a similar expression fu m = Cu m j A e j cf>) (see Caditz 
1995). Since in our method we directly take into account the truncation Cp < Cu m we can 
deal with the physically more meaningful fluxes which will be subject to the same kind of 
truncation fp > fn m - 

Figure 1 shows this bivariate distribution, which differs from the Cp-Cu m distribution 
by having the points moved diagonally parallel to the truncation line by a factor of A~jj on 
both axes. The use of fluxes rather than counts naturally increases the range and dispersion 
of the variables, but ensures that we are dealing with the more physical quantities. 

First we test the data for a correlation between fp and fn m - We expect that the 
threshold flux (determined from the background flux before a burst) will have no relation 
to the peak flux of a subsequent burst. Indeed our analysis using the method described in 
§ 2.3 shows no correlation, with the test results shown as the first entry in Table 1. This 
method (see Efron & Petrosian 1992 for complete details) evaluates a scalar test statistic 
t w , the magnitude of which is directly related to the probability P(t w ) = erfc(t tu /v / 2) that 
the observed data was drawn from an uncorrelated population. If \t w \ < 1.645, then the 
probability that the data was drawn from an uncorrelated population exceeds 10%, and one 
can say that the data show no evidence for correlation at the 90% confidence level. The 
value of t w we obtain for the 3B sample is 0.78, which corresponds to P(0.78) = 43.5%, 
confirming our expectation of no correlation. 

Next we need to determine the correlation between flux and duration from the trivariate 
distribution of /p, fn mi and T 90 . A two-dimensional representation of this distribution 
is presented in Figure 3, where the distribution of bursts projected into the T 90 -fp plane 
is shown with different plot symbols represent the different values of fn m - As described 
in §3.1, there is a bias against the detection of short duration (T 90 < At) bursts. This 
bias is evident from the paucity of points in the upper left portion of the figure. Bursts 
which should have occupied this portion of the figure have moved down to the lower left 
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portion, and bursts which should have occupied the lower left portion have moved below 
the threshold rate and are not observed. This bias therefore tends to introduce an artificial 
positive correlation between T 90 and fp. 

Determination of the correlation between T 90 and fp for the whole sample of bursts 
is complicated because of the variability of fu m and the three dimensional nature of the 
data. However, if we limit ourselves to the subset of bursts with values of fp larger than 
the largest value of fu m (bursts above the solid line in Fig. 3) the problem is reduced to 
two dimensions. Following the procedure described above, we find that there is indeed a 
significant correlation between T 90 and fu m (second entry in Table 1). 

However, we are interested in the correlation between the duration and the true peak 
flux fp. For this we use equation (16) to obtain our estimate of this true fp: 



Note that this correction can be applied to the ^1024 but not to the fp^4 because we have 
no measurements at trigger times At < 64 ms. We must now determine the correlation 
between T 90 and fp and their distributions from the data set consisting of fp } fn mi and 
T 90 . Figure 4 shows a two-dimensional representation of this data. The solid line, which 
is the horizontal line in Figure 3 transformed according to equation (18), shows that the 
short duration bursts with low values of fp are truncated. Using the methods described 
in connection with Figure 1 we can test for the independence of fp and T 90 using this 
truncated data. Again limiting our analysis to the two-dimensional case and to the sample 
of 318 bursts above the truncation line we find only marginally significant evidence for 
dependence (a small positive correlation indicated the third entry in Table 1). We see no 
sign of the anticorrelation expected for cosmological time dilation. This aspect is more 
completely described in another publication (Lee & Petrosian 1996a). 

In this paper we are concerned with the determination of bias-free univariate 
distributions of the true peak fluxes and durations. It is desirable to use the maximum 
possible number of data points so that consideration of the three-dimensional nature of the 
distribution becomes necessary. 

The three-dimensional data can be converted to the generic two-dimensional case by 
the transformation of the variable T 90 to a new variable, which we call fii m (Tgo) } given by 
an expression identical to equation (18): 



Note that fii m (Tgo) should be considered a function of T 90 and not a true threshold. 
The instrumental threshold fu m is a true threshold and is not expected to be correlated 
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with anything at all. The limiting flux fu m ,i(Tgo,i) for burst i is the minimum value its 
true peak flux fp^ must have had so that it would have triggered under the criterion 
fp t i > fiim,i- With this transformation we have now a bivariate distribution of fp and 
/;im(2go), with the generic truncation fp > fii m (Tgo). Figure 5 shows this distribution. Just 
as in the transformation from (Cp } Cu m ) to (fp,fu m ), the transformation from (fp,fu m ) 
to (fp,fu m (Tgo)) amounts to sliding the data points parallel to the truncation line, this 
time by an amount equal to 1 + At/T 90 . Clearly, long duration bursts (T 90 > At) are not 
affected by this transformation, but for short duration bursts fp and fu m {Tgo) increase, 
which increases the range of the variables. Even though fp and fu m are independent, fp 
and fiim(Tgo) may be correlated. Such a correlation would reflect any correlation that may 
exist between fp and T 90 . As shown by the fourth entry in Table 1, we find a statistically 
insignificant anticorrelation between fp and fu m {Tgo) consistent with the small correlation 
found above between fp and T 90 using the limited sample. This result will be used in the 
determination of the univariate distribution of the true peak flux fp. 

Before moving on, we note that a similar transformation may be performed on the 
data to aid determination of the distribution of T 90 . Instead of transforming the duration, 
we transform the peak flux via the inverse of the transformation given by equation (19), 

T Um (f P ) = At(fp/f Hm - I)" 1 , (20) 

so that the problem is again reduced to the generic case of a bivariate distribution with 
T 90 > Tn m (fp). Again the limiting duration Tu m ^(fp^) of burst i is the minimum value 
its duration T 9 o,; must have had so that it would have been triggered, i.e. it would have 
fp t i > fiim,i- The T 90 -Tn m distribution is shown in Figure 6. The result of the independence 
test (fifth entry in Table 1) shows an anticorrelation between T 90 and Tn m (fp) } which in 
its sense is consistent with the above results. However, the strength of the correlation is 
slightly stronger than what was found in the previous tests. The difference in strength may 
be due to a weak but complicated correlation between fp and T 90 , so that the correlation 
between these quantities is affected by the transformations given by equations (19) and 
(20). In particular, the range of Tn m (fp) for bursts with T 90 > At is very sensitive to 
the functional form chosen in equation (16) and utilized in equations (19) and (20). For 
example, if we had chosen the form appropriate for the square pulse (see Fig. 2), all the 
points in Figure 6 with Tn m (fp) > At would collapse on the line Tn m (fp) = At, while the 
points in Figure 5 would be essentially unaffected. 

We therefore conclude that there seems to be a marginally significant correlation 
between fp and T 90 which most likely should not greatly affect the determination of the 
univariate distributions. 
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3.3. Distribution of Peak Fluxes 

From the various bivariate distributions described above we can now find univariate 
distributions. We first discuss the distribution of peak fluxes. We present not only the 
cumulative distribution customarily used in the analysis of GRB data, but also a useful form 
of the differential distribution and the variation of logarithmic slope defined in equation (4). 
As is the practice in the analysis of radio source counts, we do not present the differential 

5 /2 

distribution n(fp) defined by equation (1); instead, we present f P ra(/p), which shows the 

— 5/2 

deviation from HISE, in which the expected distribution is proportional to f P . Figure 
7 shows the results. In each case the solid lines show the distributions of the peak fluxes 
fp without consideration of the truncations. The dotted histograms show when the effects 
of the duration bias and variable threshold are included via application of the method 
associated with Figure 5. 

Note that, as expected, the truncations tend to discriminate against the detection of 
weaker (lower peak flux) bursts, so that when properly accounted for the numbers of weak 
bursts is increased and the slope becomes larger (steeper N(fp) or n(fp)). We have not 
included the effect of the small correlation which we found in the fp-fu m (Tgo) distribution of 
Figure 5. Inclusion of this effect adds a qualitatively similar but quantitatively insignificant 
correction. The steepening of the corrected distributions must be taken into account when 
comparing the data with models. 

3.4. Distribution of Durations 

The first analysis of the duration distribution of BATSE data was carried out by 
Kouvelioutou et al. (1993), who plotted distributions for all bursts with known values of 
T 90 and T 50 in the IB catalog. These distributions showed the possible existence of two 
populations of bursts: short duration and long duration, with the division at T ~ 2 s. This 
analysis was done without regard for the various data truncations, nor were the effects of 
possible correlations between peak count rate Cp and duration acknowledged or considered. 
These truncations and correlations affect the distribution of durations as they do the 
distribution of fluxes. In paper PII we showed that when the first two effects are properly 
taken into account the number of bursts with short durations is enhanced considerably, 
making the distinction between the two populations even more striking. We have repeated 
this analysis for the 3B catalog, now also including the effects of the correlation between 
flux and duration. Figure 8 shows our results. It turns out that the new bursts in the 
3B-1B catalog, when limited to bursts with Cp > Cu m , do not show as strong evidence 
for two populations as the IB, so that in the total 3B catalog this distinction is somewhat 
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diminished and is barely visible in the raw data, shown as the solid histogram. 

However, when we include the effect of the duration bias, we obtain the dotted 
histogram which clearly shows a significantly larger population below T 90 < 2 s. Recall 
that in the analysis of this data we found a stronger correlation between Tn m (fp) and T 90 
than we would have expected considering the weak correlation between their counterparts 
fp oc (1 + At/T 90 ) and fii m (T 9 o) oc (1 + At/T 90 ). We suspect this difference is due to the 
form of these transformations, where small fluctuations in the fluxes can produce large 
fluctuations in the durations when the latter are greater than At = 1.024 s. Nevertheless, 
since the T 90 -Tn m (fp) distribution shown in Figure 6 shows a significant correlation, the 
simple application of the methods of §3.3 will not be correct. We must find a set of variables 
which are uncorrelated. Unfortunately this step cannot be carried out non-parametrically 
and we must assume a correlation form (i.e. a parametric transformation of the variables). 
A simple description of this process can be found in Lee et al. (1993 and 1995). Since 
we are interested in the distribution of T 90 we transform the variable Tu m to a new one 

= Tu m T$Q and determine the correlation test result t w as a function of cap. We take 
the value of cap = 0.18±g;if for which t w = ± 1.645 as our best estimate for the degree 
of correlation between T 90 and Tu m . Using ctp = 0.18, we transform into the T 90 -T{ im 
plane. Since the variables are now uncorrelated we apply the method of §3.3 to obtain the 
distribution shown by the dashed histogram of Figure 8, which shows an even stronger peak 
in the distribution at short durations. 

As mentioned above, because of the uncertainty in the transformation this last result 
is subject to error. To determine the degree of uncertainty we have used two other 
transformations, one appropriate for a square pulse (Eq. [13]) and one for a spiky pulse 
fit) = fp(l — \t/T\ n ) (note that the case n = 1 applies for a triangular pulse). We find 
that for the square pulse, t w = —3.36, while for the spiky pulses t w = —1.22, —1.58, —1.77 
for n = 1,2,3. The calculated correlations as measured by ctp vary by up to a factor of 
two from the value calculated above which was used to produce the figure, leading to 
distributions that are uncertain by this amount at short durations. Although the magnitude 
of the correction is fairly uncertain, application of the correlation correction can only 
increase the number of short duration bursts relative to the uncorrected case. 

4. Discussion and Summary 

None of our conclusions change significantly if we use T 50 rather than T 90 as our 
estimate of the duration, or if we use one of the other time scales (64 ms or 256 ms) for 
fp (note that for At = 64 ms we cannot test the validity of Eq. [16] because there exists 
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no time scale below 64 ms). We have chosen to present the 1024 ms results because the 



trigger sensitivity goes as y At, resulting in a larger sample. In this study we have used T 90 

rather than T 50 so that our results may be directly compared with previous studies (e.g. 
Kouvelioutou et al. 1993, Norris et al. 1994), which typically use T 90 or some variation of 
it. In addition, as a third measure of duration we take advantage of the fact that the energy 
fluences Te are tabulated in the BATSE catalog. We define the effective duration 



where (hv) is the average energy per photon, which for each burst can be deduced from the 
published hardness ratios with an assumption about the form of the spectra (we assumed 
that they could be described as power laws). Qualitatively, the distributions derived using 
T e ff or T 50 are very similar to those derived using T 90 . One of the few notable differences is 
that the univariate T e jj and T 50 distributions corresponding to Figure 8 extend over slightly 
less dynamic range, as expected due to the definitions of these durations. In addition, the 
correlations are somewhat weaker when using these measures of duration instead of T 90 . 
The implications of these small differences are discussed more fully in another work (Lee & 
Petrosian 1996a). 

Our results show the presence of a significantly larger population of short duration 
bursts than observed by BATSE. The question then arises if there exist two distinct 
populations of GRBs. As further evidence for this we present Figure 9 in which we plot 
the individual cumulative peak flux distributions for 8 different duration groups, each 
containing 65 sources. There does not appear to be any systematic variation in slope for the 
top six duration bins (confirming the results of § 3.2), but there seems to be a significant 
increase in the slope of these histograms for the last two bins where T 90 < 4 s. In particular, 
the value of the slope of the last bin (T 90 < 1 s), —1.3, is not significantly different from the 
value of —1.5 expected from HISE. One possible explanation for this difference would be 
that if there is indeed a real flattening of the distribution at low peak fluxes, the duration 
bias (which increases with decreasing duration) reduces the observed fluxes enough to 
truncate those points that would have shown the flattening. However, for the top seven 
duration bins combined, the slope of the distribution of ^1024 with the data truncated as 
i*p,i024 > 1 ph cm -2 s -1 (approximately the range covered by the eighth duration bin) is 



The same difference in the last two bins is also present in the distributions of fp and 
Cp. However, as mentioned in § 1, for short duration GRBs (those with T < At), the 
triggering of BATSE and other instruments is based on the photon fluence T , not the 
peak photon count rate. This fluence must exceed the threshold Tn m = jn m At. Therefore, 
for these bursts the only bias introduced is due to the variability of Tn m so that we can 






f P (huy 



0.99. 
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obtain a reliable log iV-log T histogram, which of course would be identical in shape to the 
log N- log fp histogram since fp = Tj At. The intriguing result presented above is that for 
such bursts where the complication due to light curves plays no role the analysis does not 
show a very significant deviation from HISE. 

This result could lead us to believe that perhaps the peak flux fp or photon count 
rate Cp is not a good measure of the basic energy release (and therefore distance) of 
GRBs. Indeed, the wide dispersion in duration and the multiplicity and complexity of light 
curve shapes make it unlikely that the peak of the largest spike could have much physical 
significance. Other characteristics such as the fluence of individual spikes, the total fluence, 
the duration of individual spikes, or the number and separation of the spikes may be more 
robust characteristics. If fluence turns out to be such a characteristic, then our result about 
the distribution of fluence of short duration bursts indicates that there exists the distinct 
possibility that short duration bursts are weak and nearby (distance less than several 
hundred parsecs) and therefore very nearly isotropic. It would be helpful if similar robust 
results can be obtained for longer duration bursts. We defer further discussion of these and 
other issues concerning the fluence to another publication (Lee & Petrosian 1996b). 

To summarize, the results of this paper are: 

1. We have emphasized that the data describing GRBs forms a multivariate distribution. 
Typically, it is desirable to extract univariate distributions from the multivariate 
distribution in order to compare the observations with theory. The multivariate 
nature of the distribution is important because the highly variable nature of GRBs 
and the procedures followed for their detection by instruments such as BATSE lead 
to complex data truncations in multivariable space. Hence, the distribution of each 
variable depends on the other variables, so that no single variable can be considered 
by itself. If the truncation effects are not properly accounted for, biased and incorrect 
univariate distributions can result. 

2. Biases in the distributions can be a result of not only data truncation, but also 
correlations between the variables in the multivariate distribution. We have presented 
methods to extract univariate distributions in the presence of both truncation and 
correlations. 

3. We have applied these methods to the BATSE 3B data set to extract the distribution 
of peak flux (the so-called log AMog S relation) and duration of the bursts from the 
multivariate distribution. We find that there is little evidence for any correlation 
between the flux and duration. If anything, the correlation is in the opposite sense 
of that required by cosmological time dilation and does not support the conclusions 
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of Norris et al. (1994 & 1995). When corrected for the various truncations and 
correlations, the duration distribution shows many more short duration events than 
shown in the distribution published by Kouvelioutou et al. (1993). 

4. We show that the log iV-log S relation gets increasingly steeper as the corrections for 
the various biases are applied. We find that for short duration bursts, the slope of 
the log iV-log S relation does not differ significantly from that predicted by HISE. We 
offer a possible explanation that the short duration bursts form a separate population 
of local isotropically distributed sources. 

This work has benefited from discussions with B. Efron and C. Meegan. We also 
acknowledge support from NASA grants NAGW 2290 and NAG-5 2733. 
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A. 



Correction for the Peak Flux Bias 



The use of Cp^ and Cp t io24 as estimates of the peak count rate in the intervals At = 
64 ms and 1024 ms respectively introduces the peak flux bias: Cp^t is an upwardly biased 
estimator of the true peak count rate in the interval At. This bias can be understood by 
considering the limiting case of a burst with a flat time profile of count rate C and duration 
T. The measured Cp^t will differ from AtC because of Poisson fluctuations inherent in the 
measurements. Since Cp^t is defined to be the maximum number of counts in the interval 
At, for T 3> At it is likely that these rates will exceed C At because the distribution of 
Cp,At has a finite spread about CAt. Note that Cp^ need not occur within the interval 
during which Cp t io24 occurs. 



follows. For T < 1024 ms, the ratio of the number of samples at 64 ms to the number 
of samples at 1024 ms (1 sample) increases with increasing T, so that the bias should be 
negligible at T < 64 ms and should increase rapidly with T. For T 3> 1024 ms, the bias 
should increase very slowly with T because the ratio of the number of samples at 64 ms 
to the number of samples at 1024 ms is approximately constant (ss 16). The bias should 
depend not only on T but also on C, with weak bursts being more affected by the bias 
because Poisson fluctuations in the background are relatively more important for these 
bursts. If we consider a constant count rate burst with count rate C, background count rate 
B, and duration T, sampled at intervals At, the number of samples is given by N = T / At 
and the true maximum total counts is // = (C + B) At. Because the actual measured counts 
obey Poisson statistics, there will be some spread in the distribution of the N measured 
counts so that the maximum counts likely will exceed //. The distribution of measured 
counts c is given by P(c, /i)T/At, where P is the Poisson distribution. The maximum 
expected measured counts c max is given by 



Because in general // ^> 1 (backgrounds alone are on the order of 10 2 counts in 64 ms), 
to a good approximation a Gaussian distribution may be substituted for the Poisson 
distribution, so that 



A.l. 



Estimates of the Bias 



We may estimate the effect of the peak flux bias on the ratio v = Cp^/Cp.itm as 






(A2) 
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where a = or 



so that 



2At erfc ( Cmax ^ ) , (A3) 



T V a 



(C + B)At + yJ(C + B)At erfc" 1 ( ^ ) . (A4) 



excess due to bias 

The ratio v = Cp^j 'Cp t io24 is thus given by 

1.024 c max (At = 0.064) - 0.0645 



0.064 c max (At = 1.024) - 1.0245 

0.064C + a/0.064(C + B) erfc" 1 
16 V , = T-t—r- (A5) 



0.128 



1.024C + Jl.024(C + B) erfc" 1 



2.048 



It can be seen that v exceeds v true = 1 because of the excess in both the numerator and the 
denominator due to the bias. The ratio of v to v true depends on T, C, and B. For a given 
background rate and duration, it is evident that the larger bursts will be less affected by the 
excess due to the bias, because for C — > oo, v — > v true . Large backgrounds tend to increase 
the bias, as do long durations. Table 2 lists some values of v/v true for several representative 
values of C, T, and B. It can be seen that the corrections due to the bias range from a few 
percent to up to a factor of two, depending on the parameters. 



A. 2. Bias Simulation 

To investigate the peak flux bias more thoroughly, we performed a simulation to find 
the ratio v = Cp^j 'Cp t io24 f° r constant count rate bursts of varying C and T. The bias is 
maximized for bursts of constant count rate and high background, so we chose an expected 
background count rate of B = 4000 counts/sec, which is close to the maximum background 
rate listed in the BATSE catalog. For a given C and T > At, we Poisson deviate the 
expected counts At(C + B) for each of n intervals of time At until (n + l)At > T . For 
instances where 64 ms < T < 1024 ms (or when At does not divide evenly into T), there 
will be m 64 ms intervals with zero expected excess counts. For these m intervals, we 
Poisson deviate the background counts. From the n + m intervals, the one with the largest 
number of counts is defined to be (Cp t At + B)At and Cp^t is readily found and the ratio v 
is found function of Cp 1024 and T. 

To approximately correct the ratio v derived from the BATSE data for the peak flux 
bias, we use the results of our simulation. For each burst, we use Cp t io24 an d T 90 to find the 
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associated v in our simulation. Since we know what v true is for a given T, we use the results 
of the simulation to map the observed v to v true . We note that strictly it is only possible 
to map forward from v true to v due to the variance caused by the Poisson fluctuations; 
thus a backward mapping from v to v true will only be approximately correct. Since we are 
interested not in accurately deriving v true for any individual burst but rather in determining 
gross changes in v true with T 90 , this approximation is reasonable for our purposes. We also 
note that we are assuming the maximal bias by using constant count rate bursts and a large 
background in our simulation. 

The results of this correction are shown in Figure 2. It can be seen that: 

1. As expected, the bias increases rapidly with T 90 until T 90 ^ Is. 

2. The maximum effect of the bias on v is approximately a factor of two as predicted in 
§A.l. 

3. The correction for the bias reduces the average value of v to a value slightly less than 
1 at large T 90 , indicating that the effect of the peak flux bias has been overestimated 
as predicted. We note that the corrected v versus T 90 curve fits the curve given by 
equation (16) remarkably well. 
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Test Burst Sample t w P(t w ) 

fp vs. fn m all available (518 bursts) 0.78 0.44 

fp vs. T 90 max sample with constant fu m (385 bursts) 4.17 3.0 X 10~ 5 

fp vs. T 90 max sample with constant fu m (385 bursts) 2.03 4.2 X 10~ 2 

fp vs. fiim(Tgo) all available (518 bursts) -1.49 0.14 

T 90 vs. Tn m (fp) all available (518 bursts) -2.76 5.8 x 10~ 3 



Table 1: Correlation test results for various pairs of variables. P(t w ) indicates the probability 
of obtaining the given value of t w from the distribution assuming uncorrelated variables. 
The sign of t w indicates the sense of the correlation (positive indicates correlation, negative 
indicates anticorrelation). 



c 


T 


B 


v/v t ,, 


(cts/sec) 


(sec) 


(cts/sec) 




1000 


100 


4000 


1.43 


1000 


10 


4000 


1.40 


1000 


1000 


4000 


1.52 


350 


100 


4000 


1.94 


10000 


100 


4000 


1.08 


1000 


100 


2500 


1.37 



Table 2: Values of the ratio of observed to true v = Cp^j 'Cp^^ for various choices of 
square pulse count rate C, duration T, and background count rate B. 




0.1 1 10 100 

average f P ,i 24 (photons/cm**2/s) 
Fig. 1. — The bivariate distribution of /pio24 an d fiim,i024- The diagonal line indicates the 
truncation limit /p,io24///im,io24 > 1- The boxes enclose the associated sets of a typical data 
point (/pio24 = 0.367, //i m) io24 = 0.229), which is circled in the figure. The short (long) 
dashed box contains the associated set for the point when considering the distribution along 
the horizontal (vertical) axis of fp (fu m )- The different plot symbols correspond to bursts 
of differing T 90 . Each plot symbol represents 12.5% of all bursts. In order of increasing 
T 90 : crosses, triangles, 4-pointed stars, circles, squares, 5-pointed stars, 3-pointed crosses, 
6-pointed stars. 




0.1 1 10 100 

T 90 (s) 




1000 



T 90 (s) 

Fig. 2. — top panel: The ratio Cp^/Cp.itm for the BATSE 3B data, bottom panel: The 
same ratio, but for clarity the data have been smoothed by a running median filter of 3 
points in T 90 . The dotted histogram shows the ratio obtained from the catalog data. The 
solid histogram shows the ratio corrected for the peak flux bias (see appendix A). The solid 
histogram does not extend as far as the dotted histogram because we are unable to use our 
correction procedure for bursts with T 90 < 128 ms. The smooth curves show the theoretical 
values of this ratio for a variety of pulse shapes, and the approximation of equation (16). 
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Fig. 3. — The bivariate distribution of average peak flux /p,io24 an d T 90 . The horizontal 
line indicates the highest level of // 8 ' m ,i024, above which the data is complete. The different 
plot symbols correspond to bursts of differing fu m . Each plot symbol represents 12.5% of all 
bursts, with the symbol order the same as in Figure 1. 




Fig. 4. — The bivariate distribution of true peak flux /p,io24 an d T 90 . The curve indicates 
the highest level of // 8 ' m ,i024, above which the data is complete. The different plot symbols 
correspond to bursts of differing fu m . Each plot symbol represents 12.5% of all bursts, with 
the symbol order the same as in Figure 1. 




Fig. 5. — The bivariate distribution of fp and fii m (Tgo). The different plot symbols represent 
different values of T 90 , with the ordering of the symbols is the same as in Figure 1. 




Fig. 6. — The bivariate distribution of Tn m (fp) and T 90 . The different plot symbols stand 
for different values of /p, with the ordering of the symbols the same as in Figure f . 
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Fig. 7. — Comparison of the cumulative distributions, differential distributions, and 
logarithmic slopes of the true peak flux (from top to bottom). The solid histograms are what 
would be found without consideration of the truncation. The dotted histograms are corrected 
for varying fii m (Tgo). The dot-dashed lines indicate the HISE prediction of logarithmic slopes 
of -1.5. For the middle graph, the vertical scale is arbitrary. 




Fig. 8. — Comparison of cumulative and differential distributions of T 90 (top and bottom 
panels respectively). The distributions show the effects of the corrections for (from bottom 
to top) nothing, varying T/ 8m , and correlation. 
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Fig. 9. — Individual cumulative distributions of /p,io24, in 8 groups corresponding to different 
ranges of T 90 , labeled by their logarithmic slopes. From bottom to top, the histograms 
correspond to groups of 65 bursts (12.5% of the total) in bins of increasing T 90 . The numbers 
represent the approximate logarithmic slopes of the distributions. For the sake of clarity, 
each histogram has been scaled by a factor between 1 and 10 7 . 



